Robust smoothing of gridded data in one and higher dimensions with missing values

نویسنده

  • Damien Garcia
چکیده

A fully automated smoothing procedure for uniformly-sampled datasets is described. The algorithm, based on a penalized least squares method, allows fast smoothing of data in one and higher dimensions by means of the discrete cosine transform. Automatic choice of the amount of smoothing is carried out by minimizing the generalized cross-validation score. An iteratively weighted robust version of the algorithm is proposed to deal with occurrences of missing and outlying values. Simplified Matlab codes with typical examples in one to three dimensions are provided. A complete user-friendly Matlab program is also supplied. The proposed algorithm - very fast, automatic, robust and requiring low storage -provides an efficient smoother for numerous applications in the area of data analysis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corrigendum to "Robust smoothing of gridded data in one and higher dimensions with missing values" [Comput. Statist. Data Anal. 54 (2010) 1167-1178]

On page 1170, Eq. (14) converges, for any initial conditions, if the matrix A is positive definite. In the original paper, it was asserted that D is nonsingular. This is obviously wrong since one eigenvalue is zero (see Eq. (8)). Thus the positive definiteness of A still remains to be proved. We assume that the non negative weights wi are not identically zero. By definition, A = sDTD + W and s ...

متن کامل

A method to solve the problem of missing data, outlier data and noisy data in order to improve the performance of human and information interaction

Abstract Purpose: Errors in data collection and failure to pay attention to data that are noisy in the collection process for any reason cause problems in data-based analysis and, as a result, wrong decision-making. Therefore, solving the problem of missing or noisy data before processing and analysis is of vital importance in analytical systems. The purpose of this paper is to provide a metho...

متن کامل

Statistical Characteristics of Daily Precipitation: Comparisons of Gridded and Point Datasets

Gridding of daily precipitation data alleviates many of the limitations of data that are derived from point observations, such as problems associated with missing data and the lack of spatial coverage. As a result, gridded precipitation data can be valuable for applied climatological research and monitoring, but they too have limitations. To understand the limitations of gridded data more fully...

متن کامل

Identification of the most important factors of ethnic differences in anthropometric dimensions of Iranian workers using the decision tree

Background and aims: Anthropometry is the branch of human science that considers the physical measurement of the human body, especially size and shape. One application of anthropometrical data in ergonomics is the design of working space and the development of industrialized products. So that the tools, equipment and workstations, which designed based on the physical dimensions of the workers, ...

متن کامل

Performance evaluation of different estimation methods for missing rainfall data

There are numerous methods to estimate missing values of which some are used depending on the data type and regional climatic characteristics. In this research, part of the monthly precipitation data in Sarab synoptic station, east Azerbaijan province, Iran was randomly considered missing values. In order to study the effectiveness of various methods to estimate missing data, by seven classic s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational statistics & data analysis

دوره 54 4  شماره 

صفحات  -

تاریخ انتشار 2010